Skip to content

Comments

Fix: CUDA build flags and linker compatibility for Windows and Linux#90

Open
CellularSyntax wants to merge 1 commit intoNX-AI:mainfrom
CellularSyntax:fix-cuda-build
Open

Fix: CUDA build flags and linker compatibility for Windows and Linux#90
CellularSyntax wants to merge 1 commit intoNX-AI:mainfrom
CellularSyntax:fix-cuda-build

Conversation

@CellularSyntax
Copy link

This PR improves the build logic in cuda_init.py to fix platform-specific issues during CUDA extension compilation on Windows.

Fixes:

  • Split -Xptxas flags for Windows to avoid nvcc errors
  • Uses combined flags for Linux where safe
  • Platform-specific handling of extra_ldflags (/LIBPATH vs. -L)

Tested on: Windows 11 (Anaconda, CUDA 12.8, PyTorch 2.4.0)
Not tested on Linux, but preserves the original behavior for Linux systems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant